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DIRECTIONS AND PROJECTIVE SHAPES^ 

By Kanti V. Mardia and Vic Patrangenaru 

University of Leeds and Texas Tech University 

This paper deals with projective shape analysis, which is a study 
of finite configurations of points modulo projective transformations. 
The topic has various applications in machine vision. We introduce 
a convenient projective shape space, as well as an appropriate co- 
ordinate system for this shape space. For generic configurations of 
k points in m dimensions, the resulting projective shape space is 
identified as a product of k — m — 2 copies of axial spaces RP'". 
This identification leads to the need for developing multivariate di- 
rectional and multivariate axial analysis and we propose parametric 
models, as well as nonparametric methods, for these areas. In partic- 
ular, we investigate the Frechet extrinsic mean for the multivariate 
axial case. Asymptotic distributions of the appropriate parametric 
and nonparametric tests are derived. We illustrate our methodology 
with examples from machine vision. 

1. Introduction. Consider a configuration of points in M*" . "Sliape" deals 
with the residual structure of this configuration when certain transforma- 
tions are filtered out. More specifically, the shape of a configuration consists 
of its equivalence class under a group of transformations. Important groups 
for machine vision are the similarity group, the affine group and the projec- 
tive group. Here the group action describes the way in which an image is 
captured. For instance, if two different images of the same scene are obtained 
using a pinhole camera, the corresponding transformation between the two 
images is the composition of two central projections, which is a projective 
transformation. If the two central projections can be approximated by par- 
allel projections, which is the case of remote views of the same planar scene, 
the projective transformation can be approximated by an affine transforma- 
tion. Further, if these parallel projections are orthogonal projections on the 
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plane of the camera, this affine transformation can be approximated by a 
similarity transformation. Therefore, the relationships between these shapes 
are as follows: if two configurations have the same similarity shape, then 
they automatically have the same affine shape; if they have the same afhne 
shape, they will have the same projective shape. For example, two squares of 
different sizes have the same similarity, afhne and projective shape, whereas 
a square and a rectangle have the same affine and projective shape but not 
the same similarity shape. On the other hand, a square and a kite have the 
same projective shape but not the same affine shape. 

The word "shape" often refers in statistics to similarity shape where only 
the effects of translation, scale and rotation have been filtered out (see, e.g., 
[4, 5]). In recent years, substantial progress has been made in similarity 
shape analysis, since appropriate shape space (e.g., Kendall's space) and 
shape coordinates (e.g., Bookstein coordinates) have been available. A sim- 
ple example of Bookstein coordinates is for the shape of a triangle where 
the shape coordinates are obtained after taking one of the vertices as the 
origin and rotating the triangle so that the base of the triangle lies on the 
rr-axis, and then rescaling the base to the unit size. The motivation behind 
such coordinate systems is similar to those in directional statistics, where to 
analyze spherical data one requires a coordinate system such as longitude 
and latitude (see, e.g., [18]). 

Affine shape has also received some attention; see, for example, [10, 24]. 
Sparr [24] has shown that the space of affine shapes is a Grassmann manifold. 
For affine shape in 2-D, we can obtain shape coordinates by using three 
points that determine the direction and the origin of the axes, and the unit 
length between the points on each of these two axes. 

Progress in projective shape analysis has been somewhat slow by not 
having a convenient shape space, though considerable work has appeared 
on projective invariants (see, e.g., [11, 12, 20]). We propose a convenient 
projective shape space, as well as an appropriate coordinate system for this 
shape space. 

The plan of the paper is as follows. In Section 2 we propose our approach 
in "projective shape analysis," which has its basis on the idea of constructing 
a projective frame selected from the points of a generic configuration. The 
resulting projective shape space is a product oi k — m — 2 copies of axial 
spaces MP™-. This axial representation leads to various questions in mul- 
tivariate directional statistics. To address these questions, in Section 3 we 
first discuss some parametric models, especially for the multivariate circular 
case. As a starting point, we consider certain von Mises circular distribu- 
tions. These provide good approximations to marginal distributions on the 
circle of cross-ratios with normal errors at landmarks (i.e., offset projective 
distributions), as argued through simulations in [11]. We then treat the case 
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of concentrated data by using a directional representation in a tangent space. 
In particular, the procedure is illustrated by constructing a two-sample test. 

In Section 4 we consider estimation of certain means, both asymptotically 
and through bootstrap methods. In particular, we treat the multivariate ax- 
ial case, highlighting the extrinsic mean; for m = 1, the circular extrinsic 
mean is well studied (see, e.g., [18]) and is generally referred to as mean di- 
rection. Theorem 4.1 provides asymptotic distributions of certain test statis- 
tics required for the estimation of the extrinsic mean of projective shapes 
for any m, and Corollary 4.1 provides some bootstrap approximations for 
these asymptotic distributions. In Section 4 we also provide a two-sample 
test for extrinsic means of projective shape through a bootstrapping result. 

In Section 5 we illustrate our methodology through three examples in 
object recognition. The first two examples concern building recognition and 
use circular and univariate spherical statistics, whereas the third example 
is about face recognition and uses bivariate spherical statistics. Of course, 
the realm of applications is much wider, covering other types of multivariate 
axial data. In Section 6 we present a strategy for general statistical shape 
analysis where the shapes are regarded as orbits of certain Lie group actions 
on a direct product of a number of copies of a manifold. 

2. The projective shape space. Recall that the real projective space in 
m dimensions, MP™, is the set of axes going through the origin of R™~^^. If 
X = (X\...,X™+i) gM"+i\{0}, then 

[X] = [X^:X^:...: X"'+^] = {AX, A / 0} 

is a projective point in MP™; we will reserve the notation [•] for the projective 
points throughout. In an alternative description, a point p G MP™ is given 
hy p = [z^ : z'^ : . . . : z™"*""^], where 

A linear variety v of dimension d is given hy v = {[x], x £ V \ 0} , where V is 
a (d + l)-dimensional vector subspace of M™"^^. In particular, a projective 
line / is a set associated with a vector plane V in M™"*"^, I = {[x], x £ V \ 0} . 
A number of points in MP™ are collinear if they lie on a projective line. 

The Euclidean space M™ can be embedded in MP™, preserving collinear- 
ity. Such a standard affine embedding, missing only a hyperplane at infinity, 
is 

x=(x\...,x™)^[2;i:... :x™:l]. 
This leads to the notion of affine or inhomogeneous coordinates of a point 
p= [X] = [X^:... :X™:X™+1], X™+ V 0, 
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to be defined as 

as opposed to the homogeneous coordinates of p, {X^, . . . ,X"^^^), wliich are 
defined up to a multiplicative constant only. However, the coordinates of 
interest in projective shape analysis are neither affine nor homogeneous. We 
need coordinates that are invariant with respect to the group of projective 
(general linear) transformations PGL{m). A projective transformation a of 
T^pm jg (Jgfined in terms of an (m + 1) x [m + 1) nonsingular matrix A 
e GL{m + l,M) by 

a{[X^:...: X^+i]) = [A{X^ , X'^+Y]- 

The linear span of a subset of MP'" is the smallest linear variety containing 
that subset. Note that k points in MP*" with k>m + 2 are in general position 
if their linear span is MP™ . 

Definition 2.1. A projective frame in MP™ is an ordered system of 
m + 2 points in general position. 

In computer vision, a projective frame is called a projective basis by 
some authors (Heyden [13], page 8; Faugeras and Luong [6], page 81). Let 
(ei, . . . , Cm+i) be the standard basis of M'""'"^. The standard projective frame 

is ([ei], . . . , [em,+i], [ei H h em+i]). The last point of this frame is referred 

to as the unit point. 

Proposition 2.1. Given two projective frames vri = (pi^i, . . . ,^1^^4-2) 
and 7r2 = ip2,i, ■ ■ ■ ,P2,m+2), there is a unique (3 G PGL{m) with P{pij) = P2,j, 

i = l,2,...,m + 2. 

A proof follows on noting that, given a projective frame vr = (pi, . . . ,pm+2), 
there is a unique a E PGL[m) with 

(2.1) a{[ej\)=pj, i = l,...,m + l, a([ei H V e^n+i]) = Pm.+2- 

Remark 2.1. If k > m + 2, we consider the set JFC^ consisting of 
configurations of points {pi, ■ ■ ■ ,Pk) for which there is a subset of indices 

ii < ■ ■ ■ < im+2 such that (pjj, . . . ,Pi^^2) is a projective frame. 

From Proposition 2.1, J-'C^ is an invariant generic subset of C^. It can 
be shown by considering, for example, m = l,k = 4, that the corresponding 
shape space is a manifold. Let us denote the projective shape of ipi,P2,P3,P'i) & 
TCf by a. 
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Any projective shape is in one of the sets t/123, ?7i24; f^i34 or C/234, where, 
for i < j < k, 

(2.2) Uijk = {cr\{pi,Pj,Pk) is a projective frame}. 

Assume Pr = [x^ : 1], t = 1, 2, 3, 4. Then, from (2.2) we may define the charts 
i'ijk by tpijkiiy) = c{pi,Pj,pk,Pi), where c(-) is a cross-ratio defined by 

c{pi,Pj,Pk,Pl) = {{Xi - Xk){xi - Xj)}/{{Xi - Xj){xi - Xk)} 

and {i,j,k,l} = {1,2,3,4}. On permuting indices, we find that 

/ ^ / 1 / / "^134 



V123 ■0134 - 1 

Since the transition maps between these charts are differentiable, we con- 
clude that the projective shape space associated with TCf is a 1-D manifold. 
Projective shape spaces associated with TC^ are expected to be more com- 
plicated. However, we prefer to restrict ourselves to a subset of generic con- 
figurations, such that the corresponding shape space has a natural structure 
of symmetric space (see Proposition 2.3), and for which the computations 
can be carried out using standard statistical packages. 



Definition 2.2. The axis of a point p G RP"^ with respect to a projec- 
tive frame vr = (pi, . . . ,Pm+2) is defined as p^ = a~^{p), where a S PGL{m) 
is given by (2.1). A geometric interpretation oi p'^ is given below. 

Assume fm+2 is a representative of Pm+2- Since (pi, . . . ,pm+2) 

are in general position, fm+2 can be written in a unique way as a sum 

fm+2 = /i H h fm+1, where [fj] =pj, for j = 1, . . . , m + 1. The vectors 

fi, ■ ■ ■ , fm+i form a basis of M™"'"^, and let / € 1^*"+^ be a representative 
of p. Then we denote Y^,...,Y"^^^ as the components of / with respect 
to this basis. Note that since the selection of fm+2 and of / is unique up 
to a multiplicative constant, the projective point [Y^ : . . . : is well 

defined in MP™ and p'^ = [Y^ : . . . : y™+^]. This representation of projective 
coordinates is displayed in Figure 1 for m = 1. Figure 1(a) constructs the 
coordinates li,l2 of / with respect to the frame ([/i], [/2], [/a]); Figure 1(b) 
shows the corresponding projective point [Yi :l2]- 

Let us assume that xi, . . . , Xm+2 are points in general position and let x = 
{x^, . . . , x"^)'^ be an arbitrary point in M™. In this notation the axis of x with 
respect to the projective frame associated with m + 2 points xi, . . . ,Xm+2 
is the same as the axis of p = [x^ : . . . : x™ : 1] with respect to (pi, . . . ,Pm+2)- 
Using the above geometric interpretation, we determine the axis of x in the 
following proposition. 
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Proposition 2.2. The projective axis of a point x is given by 



(2 


[z^{x):z^{x)- 


..:z™+i(x)], 


ti}h PTP 






(2.4) 


z^ix) =y^{x)/\\y{x)\\, 


j = l,...,m+ 1, 




yi[x) = V^{x)/v^{Xm+2), 




(2.5) 


y{xf = {y\x),...,y^+' 


ix)), 




z{x) = {z\x),...,z'^+^ 


{x)f, Mx)f 


and 






(2.6) 


v{x) = {v\x),...,v^+\x)f = U-\{x)., 


with the {m - 


h 1) X [m + 1) matrix 




(2.7) 


Um = [p(xi), . . 


■,P{Xm+l)], 


where p{x) = 







Proof. We note that there is a unique vector (3 G M'"+\ j3 ={(5i,. . . ,/3m+i), 
such that (5lp{xl)^ (3ni+ip{xm+i) =p{xm+2)- Namely, UmP = p{xm+2) 



OI 13 = U^^p{Xm+2) = v{Xm+2), SO that 

(2.8) (3i = v\x„,+2), i = l,...,m + l. 
Let A be the matrix 

(2.9) A = Umdmg{(3). 



If (ei, . . . , Cm+i) is the standard basis for IR™+-^, then Aei = j3ip{xi), i = 
1, . . . ,m + 1, and A{ei + • • • + em+i) = p{xm+2)- This means that the pro- 
jective transformation a, given by a ([2;]) = [Ax], has the properties that 

a[ei] = [p{xi)],i = l,...,m + l, and Q[ei H h Cm+i] = [p(xm+2)]- Hence, 

from Definition 2.2 it now foHows that the homogeneous projective coordi- 
nates of \p{x)\ are given by 

(2.10) y{x)=A-^p{x). 

From (2.8) and (2.9) we have 

y{x) = Aia.g(^,...,-^—\u~^p{x) 

= diag (^j-^ r, . . . , ^, 1/ 7] U-^p{x). 

\v'^{Xm.+2) i;™+^(x„,+2)/ 

Hence, using the definition of v{x) given by (2.6), (2.5) fohows. □ 
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Remark 2.2. We will say that y{x) are the projective coordinates of x 
with respect to the projective frame generated by (xi, . . . ,Xm+2), and note 
that [z{x)] defined by (2.3) is the corresponding point on MP*". 

Remark 2.3. Note that we have v{xi) = ei,i = 1, . . . ,m + 1. 

Let G{k,m) denote the set of all ordered systems of k points (pi, . . . ,pk) 
for which (pi, . . . ,Pm+2) is a projective frame, k > m + 2. PGL{m) acts on 
G{k,m) by a{pi,...,pk) = {api, . . . ,apk). 

Definition 2.3. The projective shape space -PS^ or space oi projective 
k-ads in MP™ is the quotient G{k,m)/ PGL{m). 

Proposition 2.3. PS^ is a manifold dijjeomorphic with (MP™)'?, where 
q = k — m — 2. 

Proof. We define F : PT.'^ (MP™)^ by 

P((pi, . . . ,pk) modPGL{m)) = (p^^+g, . . . ,pl), 

(2.11) 

TT = {pi, . . . ,p.„,+2),pj = [z\x,) : . . . :z™+^(xi)], 

where z{xi) = {z^{xi), . . . , z"^'^^{xi))'^ , \\z{xi)\\ = l,i = m + 3, . . . ,k, and z{-) 
is given by (2.3). 

The mapping P is a well-defined diffeomorphism between PS^ and a 
product of real projective spaces. 

Note that (2.11) defines an axial representation of the projective shape. In 
this representation, for m = 1 we can write (2.11) as pj = [e*'^-'], where (pj is 
the angle of an axis. Then doubling takes us to an oriented direction e*^^ € 

. Further, we assume that xi,X2,X3 yield a projective frame vr and [x: 1] 
is an arbitrary point on the projective line. Following the above algorithm 
for projective coordinates, from (2.5) and (2.6) we get 

X — X2 2/ \ xi — X 



1/ \ -^2 2/ \ 

V [X) = , V (x) 

Xl —X2 Xl - X2 

yi(x) = v^{x)/v^{x3,), y^{x) =v^{x)/v^{x-i). 
Thus, from (2.4) we have 

1/ N y^jx) 2/ N y^jx) 

^ ^""^ W{x)Y + {y^{x)YY/^' ^ {(2/i(x))2 + (y2(x))2}i/2 
or, equivalently, 

(2.12) z\x) = -^^—^^^, z'ix) "^^""^ 



{X'i-X2)d{x) (xi-X3)d(x) 
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where 

LVX3-X2/ \Xl-X-iJ J 

That is, we can write 

(2.13) =[z^{x):z^{x)], z^{x)=cos(l){x),z^{x)=s\n(t){x). 

This representation of projective coordinates is displayed in Figure 2. Note 
that on ehminating x from y^{x) and y'^{x), we get (3:3 — X2)y^{x) + (xi — 
X2,)y'^{x) = xi— X2. Since xi^ X2^ x^, this equation of a hne in the plane 
{y^{x),y'^{x)) confirms that the angle (t){x) lies between and vr. □ 



Example 2.1. We now illustrate our coordinate system for a problem 
studied in machine vision by Heyden [13], pages 33-34. He has considered a 
configuration of five points from two images of a rectangular sheet of paper. 
The five points form a cross. For the first image the coordinates are 



xi - 

Xi ■ 



(69,53), 
(69,430), 



X2 

x-> 



(591,33), 
(344,322). 



X3 = (626,402), 



The first four points are the corners and the last one is the center of the 
cross. Here m = 2 and, from (2.7) we have, on registering with respect to 
the frame x\, X2 and X3: 



69 
53 
1 



591 
33 
1 



626 
402 
1 



Hence, from (2.6) we find that v{xif = (1.0683, -1.0862, 1.0180) and ?;(x5)^ = 
(0.5057,0.0095,0.4848). Thus, from (2.4) and (2.5) we get the following 
spherical representation of the projective shape: 

z(x5) = [0.7050 : -0.0131 : 0.7092] = zi, say. 

Similarly, for the second image we have xi = (334,69), X2 = (732,290), 
X3 = (428,504), X4 = (43,200), X5 = (373,243), leading to 

z{x^) = [0.7074 : -0.0060 : 0.7067] = Z2, say. 

We will return to these coordinates in Example 4.1. 



We now describe the alternative representation of projective shape due to 
Goodall and Mardia [11] and show the connection between the two represen- 
tations. In their representation the projective shape of (pi, . . . ,pk) G G{k, m) 
is uniquely determined by its projective invariants. In fact, the projective co- 
ordinates, with respect to (pi, . . . ,Pm+2) and the projective invariants {iji)-, 
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(a) 




Fig. 2. Projective coordinate of a point on a Euclidean line (a) shows the embedding of 
an afpne line in the projective line and the projective points corresponding to the points on 
the real line; (b) shows the projective coordinate of the point p — [x : 1] = [x] with respect 
to the projective frame pi = [a;i],p2 = [x2],P3 ~ [xs] and the corresponding angle of this 
projective coordinate, 4>{x), which should be doubled to get the point 9{x) on the circle. 
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J = 1, . . . , m; i = m + 3, . . . , fc, determine each other by the invariant repre- 
sentation 

(2.14) p7 = [iii :i2j : • ■ • :'-mi : 1], 

so that the ith projective coordinate is i.^j. When m = 1,2, let us consider 
their invariant representation. For m = l, from (2.12) and (2.13) we have 



(2.15) p'' = [x:ir 



X2 Xi — X 



[c{xi,X2,X3,x) : 1]. 



LX3-X2 Xi-X3_ 

This equation shows that the projective coordinate of [x : 1] with respect to vr 
in our representation [viz., equation (2.13)] and the cross-ratio c{xi,X2,X3,x) 
determine each other. Here x could be any of the k — 3 points. For 171 = 2 we 
now assume that xi,X2, ■ ■ ■ ,Xk are k points in MP^, k> A. Let zf = z^{xi),i = 
5, . . . , fc. Then the invariants lu and L2i in (2.14) are the cross-ratios 

(2.16) i-ii = -^, l-2i = -^, 

zf zf 

where lu and L2i are the cross-ratios determined on a transversal by the pen- 
cil of lines joining the point xi to the points X2,X3,X4, Xi and by the pencil of 
lines joining the point X2 to the points xi,X3,X4,Xi, respectively. There are 
some parallel ideas between coordinates for similarity shape, affine shape and 
the projective shape considered here. We now give registration frames for 
the three shapes in 2-D. Let us consider planar similarity shape. The shape 
can be registered using two points; for example, we can use the registration 
frame with the points (0, 0) and (1, 0). Figure 3(a) shows the original configu- 
ration for = 4 and Figure 3(b) shows its Bookstein coordinates. Figure 3(c) 
is discussed below. In the case of affine shape ("intermediate" between sim- 
ilarity shape and projective shape), we can choose the registration frame 
consisting of the three points (0,0), (1,0) and (0,1). In projective shape in 
2-D, using homogeneous coordinates, we can select the registration frame 
consisting of the points (0, 0), (1, 0), (0, 1) and (1,1). In inhomogeneous co- 
ordinates, the registration frame corresponds to the points (1,0,0), (0, 1,0), 
(0,0,1) and (1,1,1) in 3-D. The steps in projective shape registration for 
1-D and 2-D are as follows: 



Case m = l. 

1.0. Start with a configuration of k points. 

1.1. Register each of the last k — 3 points with respect to first three points 
leading to (2.6). 

1.2. Transform these registered points by (2.4), leading to one point on 
the Cartesian product MP^ x • • • x RP^ of /c — 3 copies of MP^, that is, the 
projective shape of a linear configuration of k points is equivalent to fc — 3 
axes in M?. 
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(a) 



CM - 



O - 



X 



CM 



T 
-1 



T" 




"T" 
1 

(b) 



"T 
2 



T 

3 



in 
d 

o 
d 




-1.0 



0.0 0.5 1.0 



Fig. 3. (a) Similarity shape: four points in 2-D with the two base points marked by "x," 
and the other by "o." (b) Bookstein registration for Figure 3(a) with respect to the frame 
(-i,0) and(i,0). 



1.3. Transform these A; — 3 axes to directions by doubling the angles. Thus, 
we get an observation on a (A: — 3) -dimensional torus. 

Figure 3(c) gives a schematic diagram of these first two steps for k = 5. 
Case m = 2. 




Torus 

Fig. 3. (Continued) . (c) Projective shape: schematic diagram for five points in and 
their registration steps. 



2.0. Start with a configuration of k points. 

2.1. Register tlie projective coordinates of the last k — A points with re- 
spect to the first four points, say. 

2.2. Transform these registered points leading to one point on the Carte- 
sian product MP^ X • • • X MP^ of k — A copies of MP^, that is, the projective 
shape of a planar configuration of k points is equivalent to A; — 4 axes in M^. 

These schematic constructions are displayed in Figure 4 for m = 2 and 
k = 6. Note that unlike the case m = 1, for m = 2, MP^ cannot be visualized 
in three dimensions, as we need at least four dimensions to immerse the 
real projective plane into a Euclidean space without double points (see [25], 
pages I- 9-1-13). A rigorous geometric construction for m = 1 and k = 4 has 
already been given in Figures 1 and 2. 

In the subsequent discussion, we will work on the axial z-coordinates de- 
rived from the projective coordinates x with respect to the frame {xi, . . . , 2:^+2) • 
From now on the z-coordinates will be written as [X] = {[xi], . . . , [xq]), which 
corresponds to an (m + 1) x q matrix. In the case of the directional repre- 
sentation (for concentrated data), we will write X = (xi, . . . ,Xq) when there 
is no ambiguity. 
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(a) 



(b) 




Axis in 3-D 



Fig. 4. (a) Six points in and (b) a schematic representation of their projective coor- 
dinates. 
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3. Spherical distributions for projective shape. We have shown in Propo- 
sition 2.3 that the projective shape of k landmarks in m dimensions can be 
represented by q axial variables given by (2.3). Thus, we could use appro- 
priate axial distributions developed in directional statistics as models for 
projective shape (see, e.g., [18]). Simulation studies performed by Goodall 
and Mardia [11] suggest that von Mises distributions are appropriate approx- 
imations to the angular version of the cross-ratios under isotropic normal 
variation at landmarks; see their chi-square plots in Figures 9 and 10. Also, 
various applications are given in that paper. That is, for m = 1 and k = 4, 
the angle 9 = 2(j) can be considered to have the von Mises distribution with 
probability density function 

/(6l;/i,K) = {27r/o(K)}"^exp{Kcos(6l-/i)}, 6* G (0, 27r], k > 0, 

where /i G (0, 2tt\ is the mean direction, k is the concentration parameter 
and is the modified Bessel function of the first kind and order zero. 

In general, for m = l and any number of landmarks k, we require a mul- 
tivariate von Mises distribution on the torus (after doubling the angles). 
Mardia [17] proposed the following family of bivariate von Mises densities: 

const. X exp{Ki cos 9i + k,2 cos 02 + cos 9i cos 02 

+ K4 COS 9i sin 02 + K5 sin 9i cos 02 + kq sin 9i sin 02 } . 

The distribution could be generalized by mapping 9\ ^ 0i — ^ and 02 — > 
02 — u. A. particular density of interest is proportional to 

exp{Ki COS{01 — /U) + K2 COS(02 " '^) " '^S COs(0l — ^ — 02 + l^)}, 

where — vr < 0i,02 < vr, — vr < /i, < vr, ki > 0, K2>0 and ki > K3 > 0, K2 ^ 
^3 > 0. The density has the advantage that the parameters here have no 
redundancy for large concentration since the distribution tends to a full 
bivariate normal distribution. For another member of this family, see [23]. 
A multivariate extension of the distribution for variables 0i,. . . ,0q can be 
written down with density proportional to 

explSfls cos 0s + Tibg sin 0s + Sa^t cos 0s cos 0t 

(3.1) 

+ S6st cos 0s sm0t + Scst sin^^ sm0t}, 

where Ugs = bss = Css = 0,bst ^hs, and s,t = l,...,q. 

For concentrated projective shape data of configurations of k points in 
general position {xi, . . . ,Xk) where (xi, . . . ,Xm+2) yields a projective frame 
in W^, one may simply consider a multivariate directional representation 
{z{xm+3), ■ ■ ■ , z{xf:)), where z{xi) € 5"* are defined by (2.3). If there is no 
ambiguity, we will denote the vector {z{xm+3), • • • , z{xk)) as (xi, . . . , Xg). A 
projective shape is represented as a point in a direct product of q copies of 
5™. We now consider models on (5"")*^. 
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Distributions in the tangent space. For m = 1, let 7^ be the mean direc- 
tion of ^s, s = 1, . . . ,q. Then for 7^ = 0, a^j = bgt = 0, bg = and large Og, it 
can be shown that (sin^i, . . . ,s'm6q) is A^(0,S), where S is the "asymptotic" 
covariance which depends on the population parameters. If the angle 7^ is 
not zero, then we have approximately 

(3.2) (sin(0i - 71), . . . ,sin(0, - 7,))^ ~ N{0, S). 

This approximation can be extended to spherical variables for m > 1. Let 
X = (xi, . . . , Xq) be an (m + 1) x q random matrix with Xg G 5™. Note that 
these x's are not to be confused with the notation elsewhere in the paper. 
Suppose that fig is the mean directional vector of Xs,s = 1, . . . ,q, and let 
fj, = {fii, . . . ,fiq) denote the population mean directional matrix of X. Then 
define the spherical tangent coordinates of Xg by 

(3.3) Vs = (I - HstJ''^)xs, s=l,...,q. 

For the circular case with = (sin 0^, cos 0^) and fij = (sin7s, cos7s), we 
have Vs = sm{6s — 7s)es, where is a unit tangent vector at Hs, s = 1, . . . ,q. 
Then for concentrated data, it is plausible to extend (3.2) to the joint distri- 
bution ol vi, . . . ,Vq, which has a multivariate normal with zero mean vector 
and some covariance matrix. Such tangent approximations are commonly 
used in shape analysis (see, e.g., [5], Chapter 7) and found to be very ef- 
fective in practice. We now give a few results without any proof since their 
proofs are similar to those in shape analysis. 

Let Xi, i = 1, . . . ,n,he a random sample from the population with random 
matrix X, where Xi = (xj^i, . . . , Xi^q). We estimate the population multivari- 
ate mean directional matrix /i by the sample mean directional matrix fi = 
(/ii, . . . , fiq), where /ij, i = 1, . . . , g, are the standard spherical sample mean 
directions. Let Vi^g = [I — iJ-sfj''^]xi,s,s = 1, . . . ,q. Further, let v and 5 be, re- 
spectively, the sample mean and covariance matrix of Vi = {vJi, . . . , vjq)^ , i = 
l,...,n. Then the Mahalanobis distance squared = v^S~v has rank 
M = mq, where S~ is the Moore-Penrose generalized inverse of S. 

Proposition 3.1. For concentrated data the approximate distribution 
of Hotelling^s statistic is given by 

T = — — — D r^FM,n-M, 

and the asymptotic distribution of is xlf; where M = mq. 

Tangent space inference. Using the above strategy, we now construct a 
two-sample Hotelling's test. Let Xii,i = 1, . . . , rei, and X2j,j = 1, . . . , n2, 
be two independent samples, where Xu and X2j are two (m + 1) x q matrices 
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where each column lies in S*™. Suppose that /ii and ^2 are the respective 
mean population matrices. We wish to test the hypotheses 

Ho:fj.i=fi2 vs. i7i:/xi/^2- 

Let fi be the matrix of the combined (sample) mean directions given by 
(All •••)/"<?)• Let 

(3.4) Vi^s = [I - ft-sP-Jjxii^s, Wj^s=[I - fLsfi^]x2j,s, 

where i = 1,. . . ,ni, j = 1, . . . ,7t-2,s= 1, . . . ,g, andXu = {xu^i,.. .,xu^g), X2j = 
(x2j,i, . . . ,X2j^q)- Assuming that these two independent samples are from the 
normal populations in this tangent space with the same covariance matrix, 
we find that the Mahalanobis distance squared between v and w is 

D^ = {v-wfS-{v-w), 

where v is the mq x 1 vector of the means of {vfi, . . . , vj^'^ , i = 1, . . . , ni. 
The mean vector iu is similarly defined for the second sample. Further, S~ is 
the Moore-Penrose generalized inverse oi S = {niSi + n,2S'2)/("-i + n2 — 2), 
where and ^2 are the sample covariance matrices. Note that the rank of 
S is M. 

Proposition 3.2. For concentrated data the two-sample Hotelling^ s 
statistic is given by 

^3 rp2 ^ nin2(ni+n2-M-l)£)^ 

(ni+n2)(ni+n2-2)M ' 

and under Ho, is approximately distributed as FM,ni+n2-M-i- ^ general 
distribution for nonconcentrated axial data could be taken as a multivariate 
Bingham distribution. 

4. An extrinsic mean. Here, we define an appropriate location param- 
eter for a probability distribution on -PS^. Then for a random sample 
(Xi, . . . ,Xn) from a given probability distribution Q, we find a consistent 
estimator for the population location and derive its asymptotic distribution. 
We assume that a distance p on PS^ is specified. Through this distance p 
it is possible to define an index of dispersion. Given a probability measure 
Q on -PS,^, following the general treatment of Frechet [9] (see also [15]), we 
define, for y G -PS^, the function 

(4.1) TQ{y)=E[p\X,y)]= f p\x,y)Q{dx). 

Assume there is a unique y € such that inf^gp^fe -^q(V') = ^Qiv)] such 

a y is said to be the Frechet population mean, y := pyr. 
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Remark 4.1. For any probability measure Q on M the mean is always 
unique. In general, this is true for a probability measure on -PS^j but there 
are some exceptions. For example, for the uniform distribution on = PT,f 
with chord distance p defined by 

p'^{e'^,e'^) = l-cos{e-i>), 

we have Fqd/) = 27r for any point y on S^; thus y is not unique in this case. 

Let Yi,Y2, . . . ,Yn be independent and identically distributed random vari- 
ables with probability measure Q and let Qn be the empirical probability 
measure 

(4.2) q^ = 1.(Sy,+--- + 6yJ. 

The Frechet sample mean set is the set fijr = {y € -PS^, [y) = mlFp^ (y)}- 
If /ijF has a unique element, this element is called the Frechet sample mean 
and is labelled Yjr. Further, Ziezold [29] has established the strong consis- 
tency of the Frechet sample mean set on a compact metric space. Hence, 
from this result it follows that if /xjf exists, then any measurable choice y 
from fijr is a strongly consistent estimator of pj^. If the mapping 

is an embedding into a Euclidean space and we use the chord distance p, 
p{Pi,P2) = \\j{pi) - 3{P2)\\, Pi,P2 e PS^, 

then the Frechet mean is called the extrinsic mean and we denote it by p, ej , 
or write simply pE when j is known. One of the main features required in 
selecting an embedding is that the resulting mean is easily computable. 

First, we consider the case of = 1 in our formulation, which has already 
been studied in directional statistics (see [18, 21]). Note that for m = 1 the 
extrinsic mean of Q for a population of projective shapes corresponds to the 
standard circular mean direction (see, e.g., [18], pages 29-30). 

For m > 1, by Proposition 2.2 the space is identified with the 

axial space MP™. We consider the embedding j of MP™ into S{m + 1), the 
space of symmetric matrices [16] given for the directional representation 
[x] = {±x, \\x\\ = 1} by 

(4.3) j{[x])=xx'^. 

Here the Euclidean norm of a matrix A G S{m + 1) is given by = 
tr^^'^, that is, A is an (m+ 1) x (m+ 1) symmetric matrix, and if A = j{[x]) 
with ||x|| = 1, then \\A\\ = 1,A>0 and rank A = 1. 

Let [X], ||X|| = 1 be a random vector in MP™. Then from [2] it follows that 
the extrinsic population mean exists if the largest eigenvalue of E^XX"^) 
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is simple (i.e., has multiplicity one). In this case fiE = [7]) where 7 is an 
eigenvector of E{XX^) corresponding to the largest eigenvalue, with ||7|| = 
1. Moreover, if [Xr], (||^r|| =1); = is a random sample from a 

probability measure Q on MP™ and the extrinsic mean fiE Q exists, then 
the extrinsic sample mean [X]^ is a strongly consistent estimator of ^e{Q)- 
Note that when it exists, [X]^ is given by 

(4.4) W]e = [^1 

where m is a unit eigenvector of ^ X]r=i XrXj^ corresponding to the largest 
eigenvalue. It may be noted that, in this case, [X]^ is also the maximum 
likelihood estimator (MLE) for the mean of a Bingham distribution ([16, 
21]) and for the mean of the Dimroth- Watson distribution, whose density 
function at [x] is proportional to exjp{k{fi • x)^), where A: is a constant. For 
these or more general parametric families, MLE asymptotics or bootstrap 
methods [7] are commonly used. 

Example 4.1. We find that the extrinsic sample mean shape of the two 
projective shapes given in Example 2.1 is given by 

lij^ = [0.7062 : -0.0095 : 0.7080]. 

Since n = 2, we have [z]^ = [z]e = [{zi + Z2)/2]. Heyden ([13], page 34) ob- 
tained the reconstructed point in our spherical coordinates as [0.7070 : —0.0071 : 0.7071] 
by using a deterministic method. The two answers are very similar, though 
derived from two different methods. 

The main argument for using the embedding j in (4.3) is that, for this 
embedding, the extrinsic mean is easily computable via statistical packages 
with routines to carry out eigenanalysis. Another advantage is that the map- 
ping j is equivariant as shown in [16], and leads to the following multivariate 
extension to projective shapes. 

If we use the axial representation of projective shapes, for A; > m -|- 3 or 
g > 1, we can define an embedding j'^ of (MP™)'^ into {S{m + 1))*? in terms 
of j: 

(4.5) jk{[xi],- ■ ■ , [xg]) = {j[xi],. . . ,j[xg]), 

where Xs £ M™'^"^, \\xs\\ = 1, s = 1, . . . , g. Again, it can be shown that if the 
largest eigenvalues of each of the q matrix components of E{jk{Q)) are 
simple, then the extrinsic mean fj,ji_(Q) exists and is given by 



(4.6) fi := fi,,{Q) = ([7i(m + 1)], . . . , hg{m + 1)]), 
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where 7s(m + 1) is a unit eigenvector corresponding to the largest eigenvalue 
of the sth component of E[jj^[Q)). If l^,r = 1, . . . ,n, is a random sample 
from Q, then in the axial representation 

(4.7) y, = ([X,,i],...,[X,,J), ||X,,,,|| = l;s =!,...,(?, 

where X^-^s is an (m + 1) x 1 vector and 1^ is (m + 1) x g matrix. Consider 
the matrix of sums of squares and products of entries of X^^s given by 

1 " 

(4.8) Js = -Y.Xr,sXl,, 

which is a well-defined (m + 1) x {m + 1) matrix. Let ds{a) and gs{o) be the 
eigenvalues in increasing order and the corresponding unit eigenvector of Jg , 
a = l,...,m+l. Then the extrinsic sample mean in this case is 

(4.9) Yn,E = {[giim + 1)], . . . , [gg{m + 1)]). 

For q= I, Yn^E reduces to the mean given in (4.4), namely, gi{m + 1) = m. 
Arrange the pairs of indices (s, a), s = 1, . . . , g, a = 1, . . . , m, in their lexico- 
graphic order, and define the M x M symmetric matrix G by 

G{s,a),{t,b) 

(4.10) = n-\ds{m + 1) - ds{a))-\dtim + 1) - dt{b)y^ 

n 

X Y.{gJ{a)Xr,s){gI{b)Xr,t){gJim + l)Xr,sM {m + l)Xr,t). 
r=l 

It can be shown that G is a strongly consistent estimator of the covariance 
matrix of jk{Q) restricted to the tangent space of jfc((M-P™)'') at jk{f^)-, 
with respect to the orthobasis determined by the eigenvectors gs{a),s = 
l,...,q,a = l,...,m. 

Let Dg = {gs{l), ■ ■ ■ ,5s("^)), s = 1, . . . ,q, so that Dg is an (m + 1) x m 
matrix. If = ([71], . . . , [7^]), where 7^, s = 1, . . . ,q, are unit column vectors 
in M'""''^, we define a Hotelling type-statistic, 

(4.11) T\Y; Q) = ni^fD,,. ■ . Dg)G-HlI Di, . . -H^Dgf. 

Note that {-ffDi, -ygDg) is a row vector. A proof of Theorem 4.1 re- 
garding the asymptotic distribution of T{Y; Q) is available in a technical 
report of Mardia and Patrangenaru [19]; the report also examines Frechet's 
intrinsic mean. 

Theorem 4.1. Assume (y^),r = l,...,n, is a random sample from a 
probability measure Q on (MP"*)'' as above and for s = 1, . . . ,q, let Xs{a) and 7, 
be the eigenvalues in increasing order and corresponding unit eigenvectors 
of E[Xi^sXi^g] . If Xs{m + 1) > 0, s = 1, . . . ,q, are simple, then T'^{Y; Q) in 
(4.11) converges weakly to Xm- 
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The following result follows the technique of bootstrapping the distribu- 
tion of extrinsic sample means on a manifold [3] . 

Corollary 4.1. Let {Yr),r = 1, . . . ,n, be a random sample from Q on 
(MP™)"?, and let Yr = {[Xr,i], [Xr,q]), XjjXrj = l,j = l,...,q. Assume 
that Q has a nonzero absolutely continuous component. For a random resam- 
ple {Y{,. . . , Y*) from ("Ki, . . . , y„), denote the eigenvalues of ^ J2r=i ^r.s^rj 
in increasing order by d*{a),a = 1, . . . ,m + 1, and the corresponding unit 
eigenvectors by g*{a),a = 1, . . . , m + 1. Let G* be the matrix obtained from 
G, by substituting all the entries with *-entries. Then the bootstrap distri- 
bution function of the statistic 

T*\Y\Qn) = n{gj{m + l)Dl, . . .,gj{m + 

(4.12) 

X {gJim + l)Dl ,...,g^{m + l)D;f 

approximates the true distribution of T'^{Y;Q), given by (4.11), with an 
error of order Op{n~'^). 

In imaging applications, we may be given a template (population) mean in 
terms of k landmarks. The aim could then be to test for identification, that 
is, to assess if a given sample of images of a scene is from this population. 
Such a test can be based asymptotically on the test statistic given in (4.11). 
For small samples we need to use the bootstrapped statistic (4.12). 

We compare extrinsic means of projective shapes with respect to the em- 
bedding of -PS^ = (RP™")*^ in {S{m + 1))'^. Using our axial representation, 
we reduce this problem to axial statistics. Since for 6 = 1,2, Hb & (MP'")'', 
we set in axial representation fib = {fJ-b,!, ■ ■ ■ , lJ'b,g)- Without loss of gener- 
ality, we may assume that for each j the angle between /iij- and fi2,j is 
or less, and we consider the unique rotation pmj G S0{m + 1) such that 
— fi2,j and the restriction of pmj to the orthocomplement of the 
plane determined by fiij and H2,j in W^+^ is the identity. 

The equality pi = p2 is equivalent to pm,s = Lm+i,s = l,...,q, where 
/m.+i is an {m + 1) X {m + 1) identity matrix. Assume {Yi^r),i" = 1, • ■ . ,ni, 
(Y2^t),t = 1, • • • ) ^2, are random samples from Qi,Q2, respectively. A consis- 
tent estimator of the Lie group valued parameter pm = iPm,s,s = 1, . . . ,q) 
is rm = {rm,s,s = l,...,q), where, for each s = l,...,q, r^^s S SO{m + 1) 
is the unique rotation defined as above. This rotation brings the extrin- 
sic sample means (mean directions) in coincidence, that is, superimposes 
mi^s onto m2,s. Here ma,s is the unit eigenvector of ^21=1 Xr,asXj^g, where 

Ya^r = i[Xr,a,l]^ ■ • • ) [^r,a,q] ) , r = 1, . . .,ma, a =1,2. 
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Case for m = 2. A particular case of practical interest is when m = 2. 
We write here, for this particular case, p for p2 and r for r2. Here we will 
consider only the subcase k = 5, for which we give an application in the next 
section. To test the equality pi = p2 amounts to testing 

P = idj^p^ , 

where id^p2 is the identity map of in the group of isometrics /(MP^) 
of RP^. Any isometry p of MP^ can be represented in a unique way by a 
rotation T € 50(3) of the angle 6, with < 61 < TT,p{[x]) = [TX]. Note that 
a rotation T of the angle 6 = ir acts as the identity id^p2 . li p^ '''d^p2 and 
T € S0{3) represents p, there is an orthonormal basis Vi,V2,Vs of such 
that T{V3) = V3. We set 

Hip) = [iV^-TiV^),iV,xT{V,))f], 

for p^id^p2 and H{id^p2) = [1:0:0:0]. The map H:I{RP^) ^ MP^ is 
a well-defined isomorphism from /(MP^) to the axial space MP3. Modulo 
the diffeomorphism H, the equality pi = p2 amounts to H{p) = [1:0:0:0]. 
The distribution of the resulting consistent estimator H{r) of H{p) is es- 
sentially given in [1], Theorem 2.1. Assume neither ni nor n2 is small 
compared with n = rii + n2. Let G{p) be the affine coordinates of H{p); 
if H{p) = [Ho{p):Hi{p):H2{p):Hs{p)], then G{p) = G2(p), GsCp)), 

with Ga{p) = Ha{p)/Ho{p),a = 1,2,3. Using equation (5.13) of [1], page 489, 
it turns out that n^/^(r — p) has asymptotically a trivariate Gaussian distri- 
bution and is independent of n. Then by the delta method of Cramer (see, 
e.g., [7], page 45), it follows that, under the null hypothesis, if there are two 
constants ci,C2, such that n^/n — > c^, for 6= 1,2, then n^^'^{G{r) — G{p)} 
has asymptotically a trivariate Gaussian distribution which is independent 
of n. Consequently, if we consider the resamples under the empirical distri- 
bution n^/^{G(r*) — G{r)} by a nonpivotal bootstrap, then this distribution 
will have asymptotically the same distribution as that of n^/^{G(r) — G{p)}. 

Concentrated data case. For each projective coordinate from concen- 
trated projective shape data, we may select only one representative on the 
sphere. Therefore, we may use the directional representation on a product 
of q copies of S"". 

In this case, if Q is a concentrated distribution on (5'")'^, let pp, = 

(nrV: ■ ■ ■ , ttttt) be the mean multivariate direction and vn = (iMrw, • • • i nlw) 
^ IImiII ' ' mqW ' ^ ^ \\v W ' ' lly'll ' 

be the corresponding sample mean corresponding to a random sample (yi , . . . , y^) , 
Uj = (yj,...,?/j) G (5™)''. The asymptotic distribution of yo can be de- 
scribed in terms of an orthonormal frame field (ei^i(y^), . . . , ei^miv^), ■ ■ ■ , eg,i(y'^), . . . , eq^miv'^)) 
defined around po] here y = {y^ , . . . ,y'>) e (S'"')'' and for each a = l,...,q, ea,i{y°-Y ea,j{y"-) = = 1, 
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where ea,i{y^) € M™"*"^. Let G{y) be the M hy M matrix made of g by g 
square matrices of size m,Gabiy), where 

Ga,{y)=n~\\\r\\mr'lj:^[e.^^^ 

i,j = l,...,m. 

We studentize the tangential component of the difference between the sample 
and population directional means and obtain the following result. 

Theorem 4.2. Let yo - f^D = ELi T,JLi dajCaji-^^) + v he the de- 
composition of the difference between the directional sample mean and di- 
rectional mean into its tangential and normal components. If da = {daj)"^ , 
j = l,...,m, and d = {di, . . . ,dq)^ , then T'^{Y,Q, fin) = n ■ d'^G{y)~^d con- 
verges weakly to Xm- 

Remark 4.2. For concentrated data, Proposition 3.1 and Theorem 4.2 
show that asymptotically the squared norm of the Studentized sample mean 
and of the Studentized extrinsic sample mean vector both have a Xm distri- 
bution, where M is the dimension of the projective shape space. 

Corollary 4.2. A ssume Y]*, . . . , is a random resample with replace- 
ment of a random sample (Yi, . . . , Yn) from a probability measure Q on (5"")'^ 
such that fiD exists and Q has a unique absolutely continuous component 
with respect to the volume measure of (5'"^)'^. Let Qn be the empirical dis- 
tribution, and let d* ,G{y*) be the corresponding bootstrap analogs of d and 
G{y) obtained by substituting Y^ . . ,Y* for Yi, . . . , Yn, Y for fi, and Y* for 
Y. Then the distribution function ofT'^{Y,Q,fj,r,) can be approximated by the 
bootstrap distribution of T^{Y* ,Qn,YD) = nd*^G{Y*)-^d* with a coverage 
error of order Op{n~'^). 

Corollary 4.3. A (1 — a)100% bootstrap confidence region for i^io is 
given by the following: 

(a) Ra{Y) = {fi£{S'^)i\T^{Y,Q,fi)<Tf}, whereTf is the {1- a) 100% 
percentile of the bootstrap distribution T'^(Y* ,Qn,Y£)). 

(b) = {fi = (^1, ...,/.,)€ (s™)'?|r/(y,Q,/i,) < t*^^^, j = i,...,q}, 

where T*"^^^ is the (1 — ^)100% percentile of the bootstrap distributionTjiY* ^Qn^Yj^) 
corresponding to the jth directional component only. 

While Corollary 4.3(a) is useful when M = 1 or A/ = 2 (see Example 5.1), 
for larger values of M the computations are very intensive. To decrease the 
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computational complexity, one may use Corollary 4.3(b), which is based on 
a Bonferroni type of argument and gives the confidence region as a Cartesian 
product of confidence regions for the directional components of This is 
used in Example 5.3 below. 

5. Applications. To illustrate our methodology, we consider machine vi- 
sion applications involving data extracted from photographs. In the first 
two examples we use architectural features of two buildings, which form the 
preliminary stage of object recognition. In the third example we consider 
a problem in face recognition where two different views are available. We 
have assumed here that the coordinates of points are already recorded from 
a digital image; image processing software often has built-in procedures to 
extract coordinates of landmarks. There are various algorithms in practice 
to carry out this task (see, e.g., [12]). Our main aim is to show how we can 
use the one-sample and the two-sample tests when the underlying hypothe- 
ses are plausible. We show the type of computations required and indicate 
how the approximations work. We also examine how the parametric and the 
nonparametric tests perform. In Example 5.1 the problem is illustrated for 
m = l,A; = 4so q = l and involves circular statistics. In Example 5.2 m = 2, 
k = 5, whereas in Example 5.3 m = 2,k = 6, so q = 1 and q = 2, respectively, 
so that Example 5.2 involves the univariate spherical statistics, whereas Ex- 
ample 5.3 involves bivariate spherical statistics. These ideas can be extended 
to problems in machine vision such as in identification, classification and so 
on. Note that in these illustrative examples the number of landmarks k and 
the sample size n happened to be small, but the methods are applicable to 
any values of k and n. 

Example 5.1. In this example we selected randomly five photographic 
views of a building (the Education Building) from a large database of build- 
ings at the University of Leeds. One view is shown in Figure 5. It can be seen 
that the windows are equi-spaced. We selected four landmarks which are four 
central points of four consecutive windows (see Figure 5). The observed val- 
ues of these landmarks are given in Table 1. Note that here m = l,k = 4,q = 1 
and n = 5. 

We fix the projective frame vr = ([xi : 1], [x2 '■ 1], [x^ : 1]) and determine the 
cross-ratio c and projective coordinate of of p= [x4 : 1] given by (2.15). 
After doubling the angles, for each view, we get a direction 6. These values 
are shown in Table 1, together with the coordinates. 

If the landmarks are equidistant, their cross-ratio is c = 4/3 and the cor- 
responding direction is 6q = 1.287 rad. Therefore, testing the hypothesis for 
projective equidistance is equivalent to the problem 
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Education building: view 1 



Fig. 5. One view of the Education Building (University of Leeds) with four collinear 
points on four consecutive windows marked by white rings at the centre of the windows of 
the first floor. 



Table 1 

Horizontal coordinates of collinear points on the Education 
Building at 5 different views 



View 


Xi 


X2 


X3 


X4, 


c 







1 


22.90 


35.7 


48.3 


61.10 


1.340 


0.641 


1.282 


2 


23.10 


29.1 


35.5 


42.50 


1.338 


0.642 


1.284 


3 


41.40 


44.3 


47.3 


50.70 


1.353 


0.636 


1.273 


4 


39.00 


47.0 


53.9 


60.00 


1.337 


0.642 


1.285 


5 


42.25 


46.9 


50.5 


53.85 


1.373 


0.629 


1.259 



A parametric approach. Under the assumption of the von Mises distri- 
bution, as in [11], we consider the Watson- Wihiams [28] test statistic 

F(^) = {n-l){R- (cos^o, sm6o)K)/{n - R), 
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where R is the length of the resultant column vector R = X]r=i ("^^^ sm Oy-Y' 
Under Hq, for concentrated data F^^^ is approximately distributed as Fi n_i. 
For this data F^^) was found to be 2.826, which has p-value 0.168. Hence, 
we fail to reject the null hypothesis at the 5% significance level. 

Indeed, here = = 0.99994, so that data is highly concentrated. 
Hence, we can assume the tangent approximation of Section 3 is valid. Under 
this assumption the one sample f-test yields the value 0.170. So again we 
fail to reject Hq at the 5% significance level. 

A nonparametric approach. Assume now that the population distribu- 
tion Q is arbitrary and has a mean direction = exp(i^£)). We consider 
the hypotheses 

Ho:eD = 0o vs. Hi:0D^eo. 

Since the sample size n = 5 is very small, we base our p-value on Corol- 
lary 4.1. Using 5000 resample values of T'^{Y* ,Qn, Yd), we found the p- value 
to be 0.201. Thus, we again fail to reject Hq at the 5% significance level. 
In conclusion, we fail to reject the equidistance hypothesis at the 5% level 
using either test. 

Example 5.2. In this example we illustrate the two sample tests for 
m = 2 and A; = 5 so that q = 1. Again, we have used the Leeds Univer- 
sity Buildings database. In addition to the Education Building used in 
the previous example, we now consider an additional building — the Ca- 
reers Building. Two groups of identically positioned noncoUinear landmarks 
Ai,A2,A3,A4^, A^ were marked on five frontal photographs of the Education 
Building and four of the Careers Building, so that ni = 4 and n2 = 5. One 
of the buildings with the landmarks is shown in Figure 6. 

We obtain the spherical coordinates of the landmarks for the two samples 
following the calculations similar to those in Example 2.1, and these are 
given in Table 2. Assessing this part of architectural similarity between the 
two buildings is equivalent to performing a two-sample test for means. It is 
clear from our images that the architectural style of the windows based on 
these landmarks is very similar. We will also show that the Hotelling test 
based on projective invariants leads to a contradictory result, which indicates 
that we should prefer the use of the spherical projective coordinates. 

A parametric test. For the Education Building and for the Careers Build- 
ing, we find using Table 2 that the mean resultant lengths are 0.9997 and 
0.9979, respectively. Hence, the data are highly concentrated in projective 
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Careers building: view 1 



Fig. 6. One view of the Careers Buildings (University of Leeds) with five landmarks. 

shape space. Thus, we use Hotelhng's test in the tangent space given by 
(3.5). The combined mean direction and the mean resultant lengths are 

(0.7980, 0.5722, 0.1892), 0.9988, 

respectively. We find the value of F for Hotelhng's test is 2.6075, which 
is F2,6. In fact, Pr{F2fi > 2.6075) ~ 0.225 so that we fail to reject the nuh 
hypothesis. 

A nonparametric test. We selected the projective frame vr = ([^i : 1], [A2 : 1], [^3 : 1], [A4 : 1]) 
and determined the coordinates of the views in the sample, using a spher- 
ical representation; these spherical coordinates are displayed in Table 2. 



Table 2 

Spherical coordinates for the Education Building (five views) and the Careers Building 

( four views ) 



View 


Education Building 


View 




Careers Building 




1 


0.8142 


0.5547 


0.1718 


1 


0.7859 


0.5768 


0.2228 


2 


0.8038 


0.5610 


0.1977 


2 


0.8170 


0.5712 


0.0791 


3 


0.8067 


0.5591 


0.1917 


3 


0.7639 


0.6041 


0.2268 


4 


0.8150 


0.5513 


0.1787 


4 


0.7893 


0.5766 


0.2110 


5 


0.7773 


0.5890 


0.2211 
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Here the extrinsic sample mean projective shapes of views from the Edu- 
cation Building and Careers Building are given in the spherical represen- 
tation by Yi^E = [0.8037 : 0.5632 : 0.1922] and Y2,E = [0.7907 : 0.5834 : 0.1855] , 
respectively. Now consider the problem of estimating the distribution of the 
axis H{r) defined in Section 4. Since the smaller sample size is 4, and the 
eigenanalysis has to be repeated for each resample, we limited ourselves to 
250 pseudorandom resamples and determined the corresponding nonpivotal 
bootstrap distribution of G{r*). The corresponding distribution of 3G(r*) is 
displayed in Figure 7, which indicates that the sample mean of G(r) is close 
to (0,0,0). 

The rotation that brings Yi^e in coincidence with 1^2, E is identified with 
a 4-D axis (see Section 4), which turns out to be 



where we have used the dot product and cross-product of Yi^e and Y2^e- 
We determined the coordinates of the distribution of 3(G(r*) — G{r)) and, 
for this distribution, we successively sorted and trimmed the distribution of 
3G(r*) = {3Gi(r*),3G2(r*),3G3(r*)} and obtained the following 93% simul- 
taneous bootstrap confidence intervals: [—4.36,3.02] for3Gi(r*), [—3.59,2.67] 
for 3G2(r*), [-2.70,3.40] for 3G3(r*). This analysis shows that (0,0,0) is in 
the 93% percentile confidence region, that is, the identity is in the corre- 
sponding 93% bootstrap confidence region for p2 on 5*0(3). Therefore, we 
fail to reject /ii = /U2 at significance level a = 0.07. 



Fig. 7. Affine view of the bootstrapped distribution of size 250 of the nonpivotal vec- 
tor (3Gl{r*), 3G2(r*), 3G3{r*)) used in the comparison of the projective shapes of five 
landmarks on the Education and Careers Buildings. 



H{r) = [0.9997 : -0.0077 : 0.0029 : 0.0231] 




3G2(r) 
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A test based on invariants. Using Table 2, the projective invariants li 
and i2, defined at (2.16), are given below for the two buildings, respectively: 

Education Building: (4.739, 3.229), (4.068, 2.838), (4.208, 2.917), (4.561, 
3.083), (3.516, 2.664), 

Careers Building: (3.527, 2.588), (10.325, 7.219), (3.369, 2.664), (3.741, 2.733). 

We find that the observed value of F for the Hotelling test is 12.22 
and -P(-F2,4 > 12.22) = 0.0077. Thus, we reject the hypothesis of similarity 
between the two buildings. This conclusion is quite different than the one 
we get using projective spherical shape coordinates. Indeed, this aspect of 
the architecture is so similar that we should be accepting the hypothesis. 
The difference is explained by the fact that the data is not normal, and the 
test based on invariants is sensitive to departures from normality. 

Example 5.3. We now apply the method to a face recognition prob- 
lem. Figure 8(a) shows the seven frontal views (ni = 7) of the same person 
(an actor posing in different disguises) and Figure 8(b) shows his seven side 
views (n2 = 7). We recorded six landmarks (four corners of the eyes, "can- 
thus," and two end points of the lips, "mouth edge points"). Using the four 
eye-corner landmarks as the projective frame, the Cartesian landmarks were 
converted into the directional representation (bivariate spherical) , leading to 
the spherical projective coordinates xi and X2 in the same way as in Example 
2.1. This data is displayed in Table 3. 

A parametric test. First, it can be seen that, for frontal views, the mean 
resultant lengths of xi and X2 are 0.9995, 0.9955, respectively, whereas for 
the side views, the mean resultant lengths are 0.9995, 0.9996, respectively. 
These values imply that the data is highly concentrated. For the combined 
data, the respective mean resultant lengths are 0.9995, 0.9996. Thus, we 
could use the tangent space to test the hypothesis that the two means are 
equal (Section 3). We find that the combined mean directions are given by 

fif = (0.6889, 0.6735, 0.2681), = (0.7015, 0.6874, 0.1882). 

We calculated the tangent coordinates using (3.4), which, from (3.5), leads to 
the value of F = 0.8269; this has an F-distribution with degrees of freedom 4 
and 5. Since Pr{F4^g > 0.8269) = 0.5402, there is strong evidence that these 
frontal and side views are of the same person. 

A nonparametric test. The 95% pivotal bootstrap confidence region in 
the mean bivariate spherical direction based on Corollary 4.3(b), using 1500 
random resamples was found to be 2Yo.o25 = 3.21 and 72*o,o25 = 1-84 and the 
statistics {yD , y) {yo , y) obtained, using the seven side views, were 1.54 
and 1.33, respectively. It is seen that the bivariate mean projective shape 
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Table 3 

Bivariate spherical coordinates Xi — {x\,yl,zl) and X2 = {x2,y2, z^) corresponding to the 
14 Views (1-7 frontal, 8-14 side) of the actor 



View 


xl 


yl 


zl 


X2 


yi 


zl 


1 


0.700780 


0.657783 


0.276096 


0.708981 


0.676761 


0.198344 


2 


0.685337 


0.675546 


0.271939 


0.697420 


0.691293 


0.188996 


3 


0.688405 


0.650635 


0.320581 


0.709839 


0.669692 


0.218266 


4 


0.690658 


0.673332 


0.263846 


0.706231 


0.681266 


0.192651 


5 


0.691832 


0.668204 


0.273626 


0.700515 


0.685421 


0.198688 


6 


0.688246 


0.667378 


0.284470 


0.703869 


0.680057 


0.205158 


7 


0.681884 


0.685515 


0.255155 


0.692303 


0.697768 


0.183948 


8 


0.679369 


0.669555 


0.300255 


0.694591 


0.683580 


0.224191 


9 


0.686636 


0.687718 


0.235742 


0.698689 


0.696648 


0.162835 


10 


0.684002 


0.685272 


0.250087 


0.694651 


0.701190 


0.160603 


11 


0.667353 


0.699274 


0.256235 


0.679292 


0.713641 


0.171112 


12 


0.717523 


0.665701 


0.204944 


0.726679 


0.673698 


0.134419 


13 


0.694639 


0.669937 


0.262035 


0.705996 


0.686475 


0.174132 


14 


0.723910 


0.649280 


0.233215 


0.735734 


0.656621 


0.165965 



corresponding to the seven side views falls in the 95% pivotal bootstrap 
confidence region for the mean projective shape from frontal views. Thus, 
both methodologies lead to the same conclusion. 

Remark 5.1. We have used an edge registration method in developing 
the projective coordinates which are points in a curved space. The underlying 
projective shape space has the features of Kendall's shape space, since it is 
a symmetric space. For large samples, the choice of edge registration will 
have no influence on the analysis. For a similar discussion on the Bookstein 
shape coordinates, see [5], page 30. 

6. A strategy for general shape analysis. We now give a unified strategy 
underlying the three statistical shape spaces: similarity, affine and projective, 
leading to general statistical shape analysis. Each of these spaces is a space 
of orbits (orbifolds) of group actions on a finite set of points on a manifold. 
In general shape analysis, the group actions of interest are the following: 

1. In the case of similarity shape the group of direct similarities or, more 
generally, the group CO{m) x M™ of conformal linear maps of M"^, and 
the manifold is M™. 

2. In the case of affine shape, the group GL{m,M) x M™ of affine transfor- 
mations of M™' and the manifold is R™. 

3. In the case of projective shape, the group PGL{m,M.) of projective trans- 
formations of MP"^ and the manifold is MP™, or the pseudo-group action 
of PGL{m,R) on M™, regarded as the subset MP™ \ MP™"! of MP™. 
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Note that the groups are hierarchically ordered in these three types of shape. 

Let A; be a fixed positive integer, and let be the set of all configu- 
rations of k labelled points in (or in MP*"). Assume G is a group of 
transformations of (or of RP"^) that acts on the left on C^, via 

g{{xi,...,Xk)) = {g{xi),...,g{xk)), g G G,x = {xi,...,Xk) SM™. 

The orbit G{x) of such a configuration x = {xi, . . . ,Xk) € is defined by 
G{x) =: {g{x)\g € G}. The full G-shape space, GS^, is the set of all G-orbits, 
GTj^ = {G{x),x = {xi, . . . , Xk) G C^}- Note that the similarity planar shape 
space S2 (see [14]) is not a full shape space, since the orbit of x = (0, . . . , 0) 
is removed from the full CO{2) x M^-shape space. If this singular orbit is 
removed, the corresponding shape space has the structure of a manifold, 
namely, the complex projective space CP*^"^ = Sj^. 

The strategy used for similarity shape or affine shape can be extended to 
the general context by taking the following sequence of steps for any group 
action on whose orbits are closed: 

1. Identify with M*^™ via (xi, . . . ,Xk) {x\, . . . . . . . . . xj = 

(xj , . . . , x^) , j = 1, . . . ,k, and consider the topology on inherited from 
the Euclidean topology of M'^'". 

2. Consider the quotient topology on the orbit space GSj^ and let 7r:C^ 
GTj^ be the quotient map. A subset U of GS^ is open if ■K~^{pl) is open 
in C^. Recall that a subset V is generic if it is open and dense in the 
quotient topology. Note that if vr : GTI^ is the quotient map, then 

V is generic if 7r~^(V) is a generic subset in C^. Consider a generic subset 

V C GS^ that has a homogeneous structure (see [27]) or even a structure 
of symmetric space. 

3. Whenever possible, find generic shape spaces that admit homogeneous 
structures. In such a situation find an equivariant embedding of V into a 
Euclidean space which yields easily computable extrinsic sample means. 

4. Derive the distributions for marginal distributions on V resulting from 
noise at landmark locations, and if these distributions are intractable, 
approximate them with simpler distributions. 

5. Determine asymptotic distributions of Frechet sample means on V and, 
in particular, of extrinsic sample means. Then use associated statistics 
to design large sample confidence regions for population extrinsic means. 
For small samples derive corresponding bootstrap distributions for sample 
means and confidence regions. 

In this paper we introduced the projective shape space of configurations 
of points in general position following this strategy. In this case, given that 
the projective transformations in M™ form only a pseudogroup, it was more 
convenient to regard as an open affine set of MP'", and to consider 
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points in MP™' and the group G = PGL{m). We considered the generic set 
V C GTi{M.P^)^ of projective shapes of configurations of k points in MP™ 
with k>m-\-2 in general position, for which the first m + 2 of these points 
form a projective frame in MP™". This generic set V was cahed PS^ and 
we showed that PS^ is a manifold diffeomorphic with (MP™)'^, thus having 
a structure of homogeneous space. We embedded this manifold in a space 
of matrices, and computed extrinsic sample means and their asymptotic 
distributions and derived bootstrap results to deal with small sample sizes. 
In the case of linear projective shapes, we also approximated distributions on 
the space of projective shapes, resulting from noise at landmark locations, by 
distributions that are easier to handle, thus completing the general program 
presented above. In the case of planar similarity shape, for k> 2, the group 
of similarities acts freely on the space of configurations • In general, if the 
restriction of the action of the group G on to a generic subset of orbits 
is free, the dimension of each orbit is equal to the manifold dimension of the 
Lie group G. In this case one may locally select a submanifold of that is 
transverse to all the orbits, that is, dim(GS^) = km — dimG. Table 4 gives 
the dimensions of similarity, affine and projective shape spaces. Note that 
the number of degrees of freedom of the chi-square distributions needed for 
confidence regions of Frechet means is equal to the dimension of GS^ . 

Remark 6.1. Although less studied in statistics, projective shape anal- 
ysis is the most relevant in image analysis, since the pinhole camera principle 
is based on central projections. Affine shape analysis and similarity shape 
analysis are valid in image analysis only when such central projections can 
be approximated with parallel projections, or even orthogonal projections. 

Remark 6.2. Comparison of projective shapes is made easier due to 
homogeneity of the projective shape space. Recall that a space M is ho- 
mogeneous if there is a Lie group G of transformations such that, for any 
points xi, X2 € A4, there is a g € G with g{xi) = X2- In this way we may 
define a map from Mx A4 to G [which is what we did in Example 5.2, where 



Table 4 

The appropriate dimensions for different shape spaces with k points in a 

configuration 



Shape type 


Similarity 


Afflne 


Projective 


Group 


CO(m) K R'" 


Aff(m,R) 


PGL(m, R) 


Dimension 


m(m4-l) 1 -j^ 


m{m + 1) 


m{m + 2) 


Dimension of 


mk 1 


m{k — m — 1) 


m(fc — 771 — 2) 


shape space 









34 



K. V. MARDIA AND V. PATRANGENARU 



M = and G = 50(3)] and the comparison of two means is transferred 
on G. This method of comparison of projective shapes can be used to com- 
pare means of two populations on an arbitrary Riemannian homogeneous 
manifold and, in particular, on a Grassmanian manifold. Indeed, recently 
a need for population means on Grassmannian manifolds has arisen from 
signal processing; see [26]. 
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